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1 Introduction 

The second-named anthor of this paper has run REU (“Research Experiences 
for Undergraduates”) sites during each summer since 1991. The project titles 
have been The Probability Theory of Runs and Patterns; Discrete Probabil¬ 
ity and Associated Limit Theorems; Probabilistic Methods in Graph Theory, 
Combinatorics and Number Theory and Discrete Random Structures, which 
will give readers an idea of the kind of research that is conducted. During the 
summers of 1999, 2001 and 2003, the program was enriched by the addition of 
an RET (“Research Experiences for Teachers”) supplement that allowed one, 
three, and two teachers respectively to work alongside some of the brightest 
mathematics and statistics majors in the country. The work described in 
this paper was conducted by hrst-named author of this paper, a high school 
teacher in Tennessee, during the summer of 2001 ~ and under the mentorship 
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of the project director. The need to actively involve K-12 teachers in the pro¬ 
cess of mathematical discovery is now beyond debate. The increased level of 
NSF funding for the RET program, and the views of authorities such as A1 
Cuoco, Director of the Center for Mathematics Education at the Education 
Development Center, certainly bear testimony to this fact. We can do no 
better than to echo Dr. Cuoco’s words 0: 

“There are very few absolutes in education, but there’s one thing of which 
I am absolutely certain: The best high school teachers are those who have 

a research-like experience in mathematics.working for an extended period 

of time on a hard problem that has no apparent approach or solution has 
profound effects on how one perceives the nature of the enterprise. Teachers 
who have done this type of research are much less likely to think of math¬ 
ematics as an established body of facts than are teachers who have simply 
taken a set of courses. They are more likely to stay engaged in teaching 
after they start teaching.. And they are much more likely to organize their 
classes around large investigations rather than low-level exercises.... An ideal 
teacher preparation program combines the kind of orchestrated assimilation 
of the main results in mathematics....with the much messier unstructured 
explorations that come from working with a mentor and grappling with a 
research project.” 

Consider a circular pizza. If one makes n straight line cuts in the “stan¬ 
dard” way, we end up with 2n slices; the answer is, of course, different if 
one makes the horizontal and vertical cuts favored by some pizza parlors - 
but the number of slices is still linear in n. What, however, if one seeks, 
at the expense of the regularity of the shape of the slices, to maximize the 
number of slices? Key to the solution to this problem is to ensure sequen¬ 
tially that every cut intersects each of the previous ones. This generates a 
total of n^/2 -|- n/2 -|- 1 = 1 + n + {^ slices with n cuts ([7], j2], jl], 0, 
i); see Figure 1 below for the case n = 3. The maximal number of slices is 
thus quadratic in n. It has been our experience that even elementary school 
children enjoy a classroom activity based on this fact, and particularly the 
challenge associated with the successful drawing of a correct solution for n 
even as small as 5. 


2 



Figure 1 


Both aspects of our investigation into this problem are statistical in na¬ 
ture. In Section 2, we consider the simplest case of n = 3 and ask a very 
simple question. The nature of our process guarantees that the maximal 
solution will not lead to slices of equal size. How might one make the cuts, 
however, so that the slices are of as equal size as possible? In other words, 
if we denote the areas of the slices of a unit pizza by Hi,..., H 7 , how should 
the cuts be made so as to minimize V{Hi,..., H 7 }? The surprising answer is 
revealed in what we affectionately labeled the “Every Seventh Child Starves” 
principle, which dictates that the variance is minimized in a class of solutions 
when three “regular”, i.e., radial cuts are made, leading to six slices of area 
7 r /6 and one of size 0. This is clearly unacceptable from an ethical viewpoint 
and we next investigate which other measures might be used instead so as to 
give a more acceptable solution. 

In Section 3, we investigate what happens if n cuts are made so as to 
potentially maximize the number of slices, but with a blunt knife - so that 
the actual number R = Rn of slices is a random variable whose behavior 
is determined by p = pn, the probability of any cut being successful. We 
compute E(i?) and Y{R) and use Stein’s method of normal approximation 
to prove that a normalized version of R is close to normal in distribution as 
n ^ 00 . 

The research in this paper is characterized by three important pedagogi¬ 
cal facts: First, the problem emanates as an attractive pizza-cutting problem 
that we have successfully used as an “emerging pattern problem” as early 
as at the second grade level. Second, the maxim that “minimum possi¬ 
ble variance” is equivalent to the data points being “as equal as possible” 
is used, together with standard high-school AP-level calculus, to solve the 
corresponding “fair division” problem with three cuts. We have presented 
this solution to students in our Mathematical Statistics and Calculus classes. 
Also, graph paper and rulers have been used to empirically guess the answer 
to the minimum SD problem at an even lower level - with each student in 
a large elementary statistics class drawing the appropriate picture haphaz¬ 
ardly, thus creating his/her own data set with seven numbers, and Ending 
the resulting SD - and with the results of the class being pooled. Last but 
not least, routine calculations at a mathematical-statistics level are used in 
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this paper, in conjunction with a ready-to-use version of Stein’s method of 
normal approximation for sums of dependent random variables, to solve the 
random division problem with n cuts, n —> cx). Portions of this material have 
been used to create non-standard classroom examples, of the computation of 
moments, for upper level students. 

The material in this paper is similar in spirit, but quite different in con¬ 
tent, from that in P 

2 Fair Division: When is Standard Deviation 
Inappropriate? 

Figure 1 illustrates the general conhguration we believe will direct us to a 
solution. We wish to determine a function for the area of each region as a 
function of the length of arc AB, denoted by x. A few assumptions hrst: 

1) The triangle in the center is equilateral with its center coinciding with 
the center of the circle, denoted by O. As a result, the chords are congruent 
and an obvious symmetry emerges with regions ABE = HDF = CGI and 
AEFH = BEGI ^ DFGC] 

2) For convenience the circle has a radius of one unit. For discussion pur¬ 
poses we will refer to the three types of regions as the triangle, the “circular 
triangles”, and the “circular trapezoids”; 

3) X lies in the interval [0, |]. 

Given our assumptions, let’s determine a function for the area of each 
of the three types of regions as functions of x, the length of arc AB. This 
calculation involves simple trigonometry but is surprisingly and unexpectedly 
non-trivial. We have 



'A'TT T* 

mZBEO = —; mZEOB = and mZEBO 
6 ’ 2 ’ 


71 X 


6 2 


Also, the law of sines yields, since OB = 1, 


1 EB 


thus EB = 2 sin —, 
2 ’ 


X 


sin ^ sin f ’ 
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Also , the area of sector AOB equals | and thus the area of a circular triangle 
equals | — 2 sin (|) sin (f — f) • Next, the area of the triangle is computed by 
noting that its altitude equals 3 sin (f — f ) , and that \FG = sin (f — f) • 

It follows that the area of the triangle equals 3v^sin^(| — Finally, the 
area of circular trapezoid is determined by subtraction to equal 



Our three area functions will now be dehned as follows: Area of the triangle 
= «!, area of each circular triangle = 0:25 and area of each circular trapezoid 
= 0:3: 



ANALYSIS OF THE PROBLEM [Note: The calculations and graphs 
from this point were done using the MAPLE software package. Much of the 
“standard” calculus involved can be done by hand, but is rather tedious.] 
Since (Ai + A 2 + ... + Aj)/7 = ti/7, the standard deviation function is 
easily seen, on using the identity sinn = cos(7r/2 — u), to be given by 



5 




Figure 2 

Standard Deviation Function 

The standard deviation function is strictly decreasing (but with an in¬ 
teresting concavity behavior), resulting in the minimum occurring at the 
rightmost endpoint of the interval on which x, the length of the arc, is de- 
hned. See Figure 2 above. The geometric interpretation of the result is that 
the area of the triangle is 0, leaving the remaining six regions to each have an 
area of With six people getting the same size piece of pizza and one person 
receiving no piece, we refer to this as the “every seventh child starves” prin- 
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ciple. Having one child starve is, for us, an unacceptable solution, exposing a 
limitation of the variance measure in this case. So let’s consider another pos¬ 
sible measure - the mean absolute deviation. The mean deviation function 
is given by: 


md(a;) 



3^3 


■ cos 



The graph of the absolute deviation function is shown in Figure 3. The 
absolute minimum on the interval [0, |] occurs when the arc length is ap¬ 
proximately .96976, resulting in the area of the triangle being a rather small 
0.00779, with each circular triangle having an area of 0.44880, and the cir¬ 
cular trapezoids each having area 0.59581. This result is not particularly 
more satisfying than the fact that minimum SD leads to the “starving child 
syndrome.” Looking at the graph there is also a local minimum. Explor¬ 
ing this point we hud that this value occurs with an arc length of 0.45061 
yielding areas for the triangle, circular triangle, and circular trapezoid of 
0.44880, 0.09399, and 0.80361, respectively. But this fact is irrelevant since 
local minima have no particular signihcance. 
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Figure 3 

Absolute Deviation Function 


Still not satisfied, let’s explore a third function. Suppose we seek to 
maximize the size of the smallest piece, subject to the cuts being of the 
required kind. Figure 4 below shows the three area functions graphed on the 
same set of axes, and it is clear from that diagram that our criterion is met 
when the triangle and circular triangles each have an area of ~ 0.2 and the 
circular trapezoids are each of area ~ 0.78. This is the solution in which 
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three people will get equal amounts and four people will get equal, but lesser 
amounts. We consider this to be our most equitable solution; readers will 
undoubtedly think of other criteria that might be used in this context. 



Figure 4 

The Three Area Functions 
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3 Random Division and Stein’s Method 


Proposition 1 If n cuts are independently made in a pizza, each with a 
probability p of success, and so that each cut intersects each prior successful 
cut, then the expected number of regions is given by 

E{R) = I+ np+ (^P^- 


Proof The number of regions formed by n cuts is given by 1 + (”) + (”) or 
1 + n + . Since any given cut will be made with probability p and not 

made with probability 1 — p, the random variable X, denoting the number of 
cuts made, will possess a binomial distribution. Thus the number of regions 
will be 


so that 


E{R) = 1 + E(X)+E ^ 


= l + np + ^ 


x{x-l) 


X 


= 1 + np — p^nin — 1) -- 

9 ^ I rr. — 


2 )! ^x-2^n-x 


{x — 2)\{n — x)\ 


^ nin — 1) 2 

= 1 + np H---p 


= 1 + np + 


n 


p 


as claimed. 

Lemma 2 


E(i?^) = (1 + 3np +-n(n — l)p^ + 2n(n — l)(n — 2)p^ 
+ ^n(n — l)(n — 2)(n — 3)p"‘) 


Proof We omit the details. We use the fact that (i) E(Z^) = E(Z(Z — 1)) — 
E(Z), and (ii) R — 1 = X + X{X — l)/2, as well as formulas for the third 
and fourth moments of the binomial distribution. 
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Theorem 3 Assuming that np ^ oo as n ^ oo, Y{R) = n^p^{l — p){l + 

0 ( 1 )). 

Proof 


Y{R) 


E{R^) - {E{R)'f 

+ 3np + -^^(n — l)p^ + 2 n(n — l)(n — 2 )p^ 

+ ^n{n — l)(n — 2 )(n — 3)^"^^ 

— + 2np + n{n — l)p^ + {npY + n^{n — l)p^ + -n^(n — l)^p^ 

1 1 
np + -n(5n — 7)p^ + n{n — l)(n — 4)p^ — -n(n — l)(2n — 3)p^. 


If npn = np —i> oo as n —>■ cx) and if p tends to a non-zero constant, then 
the last two terms above dominate the variance, and the assertion follows. If 
p — 0 , the third term dominates and the conclnsion still holds. 

Next consider a 3 dimensional object snch as a watermelon. We know 
that for n cnts there will be a maximnm of 1 -|- n -|- ( 2 ) + ( 3 ) pieces jH]. As 
with the pizza, there are n people to make a cnt in the watermelon, where 
each cnt will independently be made with probability p. Thns 

= 1 + X + where A' ~ p); 

2 0 


^3d(^) — 1 + 1B(A) + IE 


/.Y(.Y - 1) 




/ A(A-lK.Y-2) 


n(n — 1) 


= 1 -|- np H-^--p^ H—nfn — l)(n — 2 )p^, 

6 


after doing a calcnlation similar to the one above. Also, 

VsniR) = E(R^)-(E(R)f 

9 


/ 9 25 

= (1 + 3np -I —n(n — l)p^ -I -n(n — l)(n — 2)p' 

V 2 6 


25 

-I-—n(n — l)(n — 2)(n — 3)p^ 
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+—n{n — l){n — 2){n — 3)(n — 4)p® 

+—n(n — l)(n — 2)(n — 3)(n — 4)(n — 

36 / 

/ 2 

— f 1 + 2np + n{2n — l)p^ -\ — n{n — 1)(2n — l)p^ 

V 3 

H-n^(n — l)(7n — ll)p^ H—n^(n — l)^(n — 2)p^ 

12 6 

+ ^n^{n-lf{n-2fp^^ 

= np+ -n{5n — 7)p^ + -n(n — l)(19n — 50)p^ 

2 6 

+-n(n — l)(3n^ — 19n + 25)p"^ 

+^n(n — l)(n — 2){v? — lln + 20)p^ 

— -^n{n — l)(n — 2)(3?7,^ — ISn + 20)p® 

if np —>■ oo. 

Continuing the pizza/watermelon analogy, let’s consider a d-dimensional 
“hypermelon” maximally cut by n hyperplanes. It has once again been shown 
by Steiner [Zj that there would exist Yli=o (i) regions formed jH]. Other 
approaches to the proof of this may be found in 0) 0) 0) and [HI. If 
each of the n possible cuts are again successfully made independently with 
probability p, it may be shown as above that the expected value for the 
number of regions created in the d-dimensional “hypermelon” is given by: 


EdD(-R) — 




p 




of course, in hindsight, the expected value calculation in each dimension fol¬ 
lows from the fact that the rth factorial moment of a binomial (n, p) random 
variable equals n{n — 1)... [n — r + l)p”. 

We now investigate the nature of the concentration of R around its ex¬ 
pected value in two dimensions: By Chebychev’s Inequality, 

P(\R-E(R)\>\)<^ 
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for every A > 0, so that we see that P(|-R —E(i?)| > A) —0 if A 
(np)3P^l _ p, i.e., if A/((np)3P-y/l — p) oo. As a result we have, for 
p = 1/2, say, that R is concentrated in an interval of length around 

E(P) ~ n^/8 where (j){n) —>■ cx) is arbitrary. To give two other examples, 

hrst, if p = 1/y/n then E(P) ~ n/2 and Y{R) ~ so the concentration 

is in a window of magnitude (f){n)in?/^. Second, if p = 1 — (l/ydi), then 

E(P) ~ n^/2 and V(P) ~ so that the distribution of R is concentrated 
in an interval of size around E(P). 

We next prove a central limit theorem for the distribution of R. Not 
surprisingly, a CLT does not always hold due to the dependencies among the 
indicator random variables involved. However, we are able below to fruitfully 
use a CLT of Rinott jB] to establish the following result: 

Theorem 4 The distribution of R = Rn satisfies, for p{l — p)^/^ ^ 


lim sup 


P 


R - E(P) 

TMW 



<h(ta) 


0 , 


where <h denotes the standard normal distribution function. (This is tanta¬ 
mount to the assertion that a standardized version o/ 1 + X + X(X — l)/2 
is asymptotically normal when X ~ Bi(n,p)./ 

Proof Since X = where Ij = 1 iff the cut is made and equals 

zero otherwise, we have 


R 


1 + X + 


X(X - 1) 


^ 1 

1 + + 2 (A + ••• + 

i=i 


1 ) 


1 H-^-h — (/l + ... + In) (II + ••• + In) 

1 + Jl + ...In + - ^ IJj 


We use a bound given by Rinott [H] using dependency graphs and Stein’s 
Method. Let the + 1 summands that constitute R be the vertices of the 
graph. Call these summands {R : 1 < i < 1} for convenience. Draw an 

edge between two vertices if they are dependent, i.e. if they share at least one 
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common indicator index Then the degree of the summand “1” is zero; 
for each j, the vertex Ij has degree 2{n — 1); and for any i,j, the vertex 
IJj has degree 4(n — 1). Furthermore for any two sets of vertices A and B 
such that there is no edge beween one vertex in A and another in B, the 
sets {Ji : i E A} and and {Ji : i E B} are independent. We thus have a 
dependency graph of + 1 variables each bounded by i? = 1, and having a 
maximum degree of less than D = 4n. Rinott’s Theorem 1.2 applies to give 
us, for each w, 


P 


R - E{R) 


< tc — 


' —CO a/ 27r 


e 2 ^ dz 


< C max 


ND^B^ {ND^B^f/^ DRl 

’ (T / ’ 


where iV, the total number of summands is bounded by a constant times n^. 
Applying Rinott’s Theorem and ignoring constants, 


ND^B^ 




{ND^B^y/^ 




DB 


a 


r? ■ V? 


{rAp^il — p)yR n^Ap9/2(x _ p)3/2 

if 


(n^ ■ 1 


_ p'^ — p) 

1 


0 


if 


n 


1 / 6 - 


n 


0 


fARpAR{\ — pRyR "ndRpAR{\ — pyR 

if 


n 


1/3- 


The hrst of the above three conditions is the strongest of the three, proving 
Theorem 4. This condition is satished if p is hxed or tends to zero so that 
p 3> If however, p is close to one (and in particular approaches 1 as 

n oo) then the (1 — p) term dominates, and we need (1 — p) 3> 
for the result to hold. The cases p = 0,1 correspond to the deterministic 
situations in which the CLT is not valid; Theorem 4 gives bounds on how 
close p may be to either extreme while still giving normal convergence. 
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4 (Open) Problems for Research and Class¬ 
room Discussion 

(i) How do the results of Section 2 generalize for n = 4, 5,...? (ii) Can we 
assert that the solution for n = 3 also minimizes the variance of the pieces 
among all possible choices of 3 maximal cuts, both regular (as considered by 
us) and irregular? 
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